FASTER TRANSFORMS USING EARLY 
ABORTS AND PRECISION REFINEMENTS 

CROSS-REFERENCE TO RELATED APPLICATION 
5 This application is related to the following co-pending and commonly- 

assigned patent applications, which are hereby incorporated herein by reference in 
their respective entirety: 

"FASTER DISCRETE COSINE TRANSFORMS USING SCALED TERMS" to 
Brady et al., having attorney docket no. BLD9-2000-0056US1 . 
1 0 "FASTER TRANSFORMS USING SCALED TERMS" to Trelewicz et al., 

having attorney docket no. BLD9-2000-0059US1. 

BACKGROUND OF THE INVENTION 



15 1. Field of the Invention . 

This invention relates in general to data processing, and more particularly to 
faster transforms that use early aborts and precision refinements. 



2. Description of Related Art . 
20 Transforms, which take data from one domain (e.g., sampled data) to another 

(e.g., frequency space), are used in many signal and/or image processing 
applications. Such transforms are used for a variety of applications, including, but 
not limited to data analysis, feature identification and/or extraction, signal correlation, 
data compression, or data embedding. Many of these transforms require efficient 
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implementation for real-time and/or fast execution where compression may or may 
not be used. 

Data compression is desirable in many data handling processes, where too 
much data is present for practical applications using the data. Commonly, 
compression is used in communication links, to reduce transmission time or required 
bandwidth. Similarly, compression is preferred in image storage systems, including 
digital printers and copiers, where "pages" of a document to be printed may be 
stored temporarily in memory. Here the amount of media space on which the image 
data is stored can be substantially reduced with compression. Generally speaking, 
scanned images, i.e., electronic representations of hard copy documents, are often 
large, and thus make desirable candidates for compression. 

In data processing, data is typically represented as a sampled discrete 
function. The discrete representation is either made deterministically or statistically. 
In a deterministic representation, the point properties of the data are considered, 
whereas, in a statistical representation, the average properties of the data are 
specified. In particular examples referred to herein, the terms images and image 
processing will be used. However, those skilled in the art will recognize that the 
present invention is not meant to be limited to processing images but is applicable to 
processing different data, such as audio data, scientific data, image data, etc. 

In a digital image processing system, digital image signals are formed by first 
dividing a two-dimensional image into a grid. Each picture element, or pixel, in the grid 
has associated therewith a number of visual characteristics, such as brightness and 
color. These characteristics are converted into numeric form. The digital image signal 
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is then formed by assembling the numbers associated with each pixel in the image into 
a sequence which can be interpreted by a receiver of the digital image signal. 

Signal and image processing frequently require converting the input data into 
transform coefficients for the purposes of analysis. Often only a quantized version of 
5 the coefficients is needed (e.g. JPEG/MPEG data compression or audio/voice 

compression). Many such applications need to be done fast in real time such as the 
generation of JPEG data for high speed printers. 

Pressure is on the data signal processing industry to find the fastest method 
by which to most effectively and quickly perform the digital signal processing. As in 

10 the field of compression generally, research is highly active and competitive in the 
field of fast transform implementation. Researchers have made a wide variety of 
attempts to exploit the strengths of the hardware intended to implement the 
transforms by exploiting properties found in the transform and inverse transform. 
One such technique is the ISO 10918-1 JPEG International Standard /ITU-T 

15 Recommendation T.81 . The draft JPEG standard is reproduced in Pennebaker and 
Mitchell, JPEG: Still Image Data Compression Standard, New York, Van Nostrand 
Reinhold, 1993, incorporated herein by reference. One compression method 
defined in the JPEG standard, as well as other emerging compression standards, is 
discrete cosine transform (DCT) coding. Images compressed using DCT coding are 

20 decompressed using an inverse transform known as the inverse DCT (I DCT). An 
excellent general reference on DCTs is Rao and Yip, Discrete Cosine Transform, 
New York, Academic Press, 1990, incorporated herein by reference. It will be 
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assumed that those of ordinary skill in this art are familiar with the contents of the 
above-referenced books. 

It is readily apparent that if still images present storage problems for computer 
users and others, motion picture storage problems are far more severe, because 
full-motion video may require up to 60 images for each second of displayed motion 
pictures. Therefore, motion picture compression techniques have been the subject 
of yet further development and standardization activity. Two important standards 
are ISO 11172 MPEG International Standard and ITU-T Recommendation H.261. 
Both of these standards rely in part on DCT coding and IDCT decoding. 

However, research generally focuses on specific techniques, such as the 
above-mentioned techniques that used DCT coding to provide the desired degree of 
compression. Nevertheless, other transforms may be used to provide certain 
advantages under certain circumstances. For example, in the DCT compression 
coding method discussed above, an input image is divided into many uniform blocks 
and the two-dimensional cosine transform function is applied to each block to 
transform the data samples into a set of transform coefficients to remove the spatial 
redundancy. However, even though a high compression rate may be attained, a 
blocking effect, which may be subtle or obvious, is generated. Further, vector 
quantization methods that may be utilized by the compression system are 
advantageous due to their contribution to the high compression rate. On the other 
hand, a sub-band method may reduce the blocking effect which occurs during high 
rates of data compression. The wavelet transform (WT) or Sub-Band Coding (SBC) 
methods encode signals based on, for example, time and frequency components. 
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As such, these transform methods can be useful for analyzing non-stationary signals 
and have the advantage that they may be designed to take into account the 
characteristics of the human visual system (HVS) for image analysis. 

Scaled terms may be used to replace multiplicative constants like cosine 
terms in a Discrete Cosine Transform (DCT) with a minimum number of 
additions/subtractions. However, the scaled terms merely approximate the 
constants in the transform equations. Thus, some error is accepted to keep the 
precision confined to a fixed number of bits or to minimize the number of operations. 
If the resulting numbers are further from a decision boundary (e.g., a threshold value 
or a quantization boundary) than the maximum possible error, the result will not be 
affected by the approximations. However, the resulting numbers may be 
determined, during the incremental calculations, to require additional precision. Yet, 
the original input values are no longer available in the registers, and refetching the 
original input values from memory can impose cycles associated with cache misses 
and memory latency. The brute-force option is to perform an inverse transform (e.g., 
an I DCT) on the values, and then re-run the forward transform (e.g., FDCT, 
sometimes denoted just DCT) with higher precision. The disadvantage of the brute 
force approach is that operations are wasted. 

It can be seen then that there is a need to provide faster transforms that use 
early aborts and precision refinements to save processing cycles thereby providing 
faster transform calculations and decreased execution times. 
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SUMMARY OF THE INVENTION 
To overcome the limitations in the prior art described above, and to overcome 
other limitations that will become apparent upon reading and understanding the 
present specification, the present invention discloses faster transforms that use early 
5 aborts and precision refinements. 

The present invention solves the above-described problems by detecting 
when to perform a corrective action based upon testing the incremental calculations 
of transform constants and performing the corrective action: refining the incremental 
calculations to obtain additional precision and/or aborting the incremental 
10 calculations when the resulting number is going to be too small. Those skilled in the 
art will recognize that throughout this specification, the term "matrix" is used in both 
its traditional mathematical sense and also to cover all hardware and software 
systems which when analyzed could be equivalently represented as a mathematical 
matrix. 

1 5 A method in accordance with the principles of the present invention includes 

testing at least one number resulting from an incremental calculation of transform 
coefficients during a transform, determining whether to perform a corrective action 
based upon the testing and performing the corrective action when a corrective action 
is determined to be needed. 

20 Other embodiments of a method in accordance with the principles of the 

invention may include alternative or optional additional aspects. One such aspect of 
the present invention is that the determining comprises detecting whether the 
incremental calculation of the transform coefficients will result in transform 
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coefficients with unacceptable precision and the performing corrective action 
comprises refining the at least one number. 

Another aspect of the present invention is that the transform comprises a 
transform matrix and wherein the refining comprises applying a refinement matrix for 
5 increasing precision of the incremental calculation of the transform constants. 

Another aspect of the present invention is that the refinement matrix 
comprises /+„D m+1 D~ l . 

Another aspect of the present invention is that the method further includes 
generating at least one refinement matrix based on approximately calculated 
1 0 transform constants. 

Another aspect of the present invention is that the generating at least one 
refinement matrix is performed offline or at initialization. 

Another aspect of the present invention is that the generating the refinement 
matrix comprises recognizing that an approximate transform is invertible, generating 
15 the refinement matrix given by I+ d D m+l D~ l , and structuring the transform for 

efficient computation. 

Another aspect of the present invention is that the generating the refinement 
matrix includes recognizing that recovery of the nth column of a transform matrix for 
generating the transform is impossible, calculating a pseudo inverse for a portion of 
20 the transform matrix and generating an approximation for the refinement matrix 
using the pseudo inverse for the transform matrix. 
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Another aspect of the present invention is that the approximation of the 
refinement matrix comprises I+ d D u D Q . 

Another aspect of the present invention is that the determining further 
comprises determining whether an error resulting from terminating the incremental 
5 calculation is acceptable and the performing corrective action comprises aborting the 
incremental calculation of a transform coefficient. 

Another aspect of the present invention is that the incremental calculation is 
terminated when a determination is made that the incremental calculation will result 
in a number that is projected to be within a predetermined range. 
1 0 Another aspect of the present invention is that the number that is projected to 

be within a predetermined range comprises a transform coefficient that does satisfy 
a precision requirement. 

Another aspect of the present invention is that the incremental calculation is 
terminated when a refinement to the transform coefficient is determined not to 
15 change the result. 

Another aspect of the present invention is that a refinement to the transform 
coefficient is determined not to change the result when, after checking the relative 
magnitudes of the results of the incremental calculations, an intermediate calculation 
of at least one transform coefficient is small compared to the intermediate calculation 
20 of another transform coefficient. 

Another aspect of the present invention is that a refinement to the transform 
coefficient is determined not to change the result when, after checking the 
magnitude of the results of at least one incremental calculation, at least one 
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intermediate calculation of the transform coefficient is less than a predetermined 
threshold. 

Another aspect of the present invention is that the determining further 
comprises determining that a transform coefficient is going to be within a 
predetermined range of zero and the performing corrective action comprises 
aborting the incremental calculation of the transform coefficient. 

In another embodiment of the present invention, a data compression system 
is provided. The data compression system includes a transformer for applying a 
linear analysis transform to decorrelate data into transform coefficients using 
transform equations, the transformer reducing errors of the transform by testing at 
least one number resulting from an incremental calculation of transform coefficients 
during a transform, determining whether to perform a corrective action based upon 
the testing and performing the corrective action when a corrective action is 
determined to be needed. 

In another embodiment of the present invention, a printer is provided. The 
printer includes memory for storing image data, a processor for processing the 
image data to provide a print stream output and a printhead driving circuit for 
controlling a printhead to generate a printout of the image data, wherein the 
processor reduces errors of the transform by testing at least one number resulting 
from an incremental calculation of transform coefficients during a transform, 
determining whether to perform a corrective action based upon the testing and 
performing the corrective action when a corrective action is determined to be 
needed. 
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In another embodiment of the present invention, an article of manufacture is 
provided. The article of manufacture includes a program storage medium readable 
by a computer, the medium tangibly embodying one or more programs of 
instructions executable by the computer to perform a method for reducing errors 
5 during data processing, the method including testing at least one number resulting 
from an incremental calculation of transform coefficients during a transform, 
determining whether to perform a corrective action based upon the testing and 
performing the corrective action when a corrective action is determined to be 
needed. 

10 In another embodiment of the present invention, a data analysis system is 

provided. The data analysis system includes transform equations formed by testing 
at least one number resulting from an incremental calculation of transform 
coefficients during a transform, determining whether to perform a corrective action 
based upon the testing and performing the corrective action when a corrective action 

15 is determined to be needed and a transformer for applying the transform equations 
to perform a linear transform to decorrelate data into transform coefficients. 

These and various other advantages and features of novelty which characterize 
the invention are pointed out with particularity in the claims annexed hereto and form a 
part hereof. However, for a better understanding of the invention, its advantages, and 

20 the objects obtained by its use, reference should be made to the drawings which form 
a further part hereof, and to accompanying descriptive matter, in which there are 
illustrated and described specific examples of an apparatus in accordance with the 
invention. 
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BRIEF DESCRIPTION OF THE DRAWINGS 
Referring now to the drawings in which like reference numbers represent 
corresponding parts throughout: 

Fig. 1 illustrates a typical image compression system; 
5 Fig. 2 illustrates a flow chart of a method for providing faster transforms using 

scaled terms; 

Fig. 3 illustrates a flow chart for providing faster transforms using corrective 
action to provide faster transform calculations and decreased execution times; 

Fig. 4 illustrates a flow chart of the abort method according to the present 
1 0 invention that demonstrates aborting further iterations of the transform coefficient 
calculation process; 

Fig. 5 is illustrates the testing of the at least one incrementally calculated 
number; 

Fig. 6 is a flow chart of the refinement method according to the present 
15 invention; 

Fig. 7 illustrates a flow chart of a first method for generating a refinement 

matrix; 

Fig. 8 is a flow chart showing a second method for generating a refinement 
matrix when d D 0 is not invertible; 
20 Fig. 9 illustrates a printer according to the present invention; 

Fig. 10 illustrates a data analyzing system according to the present invention; 

and 
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Fig. 1 1 illustrates another data analyzing system according to the present 



invention. 
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DETAILED DESCRIPTION OF THE INVENTION 
In the following description of the exemplary embodiment, reference is made 
to the accompanying drawings which form a part hereof, and in which is shown by 
way of illustration the specific embodiment in which the invention may be practiced. 
5 It is to be understood that other embodiments may be utilized as structural changes 
may be made without departing from the scope of the present invention. 

The present invention provides faster transforms that use early aborts and 
precision refinements. Faster transforms are obtained by detecting when to perform 
a corrective action based upon testing the incremental calculations of transform 
10 coefficients and performing the corrective action: refining the incremental 
calculations to obtain additional precision and/or aborting the incremental 
calculations when at least one resulting number is going to be too small. 

Ficf.J^ttqgffates alVp'fc'an^ 100. The data_ 

compression system may include three closelvgpRfrgSfed components namely (a) 
15 Transformer 120, (b) Quantizer 1^Pt^nd(c) Optional Entropy Encoder 140. 
Compression is accomoliaffed by applying a linear transform to decorrelate the 
image data 1 10j^dantizing the resulting transform coefficients, and, if desired, 
entropy coding the quantized values. A variety of linear transforms have been 
developed which include Discrete Fourier Transform (DFT), Discrete Cosine 
20 Transform (DCT), Discrete Wavelet Transfolm (DWT) and many more, each with its 




advantages and disadvani 
The quantizer 130 simply reduces the number of bits needed to store the 
transformed coefficients by reducing the precision of those values. Since this is a 
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many-to-one mapping, it is a lossy process and is the significant source of 
compression in such an encoder. Quantization can be performed on each individual 
coefficient, which is known as Scalar Quantization (SQ). Quantization can also be 
performed on a collection of coefficients together, and this is known as Vector 
Quantization (VQ). Both uniform and non-uniform quantizers can be used 
depending on the problem at hand. 

Wte-Opriunai entropy encoder 140 further compresses Trrei^raRtized^y^ — 
losslessly to give better overall compression. It maiu*se~cfmodel to accurately 
determine the probabilities for eacj>qa3ntized value and produces an appropriate 
code based on thesepfcJ^abilities so that the resultant output code stream will be 
smaller thap>tffe input stream. The most commonly used entropy encoders are the 
HuffjTt^m encoder and t he arithmetic e ncoder, although for applications requiring fast 
^eeafiSn, simple run-length encoding (RLE) hasT5iisveA^ 

The term image transforms usually refers to a class of unitary matrices used 
for representing images. This means that images can be converted to an alternate 
representation using these matrices. These transforms form the basis of transform 
coding. Transform coding is a process in which the coefficients from a transform are 
coded for transmission. 
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The signal F(x) is a function mapping each integer from 0..n - 1 into a 
complex number. An example is given by a line of a sampled or pixelated image, 
where the samples or pixels are equally spaced. An "orthogonal basis" for a 

collection of such F(x) is a set {b y (x)} n ^ o of functions, where ^b y (x)b z (x) = 0 for y * 

x=0 
n-l 

5 z. A "transform" of F(x) % denoted F{y) , is given by F(y) = T F(x)bJx) . Transforms 

~i y 

of this type are used in many signal and image processing applications to extract 
information from the original signal F. One example of a transform is the discrete 
Fourier transform (DFT), where b y (x) = exp (27iixy/n). A related example is the 
discrete cosine transform (DCT), where b y (x) = cos (27txy/n) t Another example is the 

10 wavelet transform, where b y (x) is a particular scaled and offset version of the mother 
wavelet function. (See, Ingrid Daubechies, Ten Lectures on Wavelets , Society for 
Industrial & Applied Mathematics, (May 1992)). 

The theoretical basis for the independent scaling operations will now be 
demonstrated by showing the mathematical basis for being able to perform the 

1 5 scales without destroying the structure of the transform. Define a transform 

n-l 

F(y) = ^F(x)b y (x) . Consider those cases (described below) when the b y (x) are 

jc=0 

such that this transform can be split into two or more disjoint sums, regardless of the 
structure of F(x). (The term "disjoint", when used herein in reference to the sets of 
equations, means that there are no transform coefficients in common between 
20 equations in the two disjoint sets of equations.) For example, if b 2y (x) have even 
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symmetry, and b 2y+ i(x) have odd symmetry, it is known from mathematics that any 
F(x) can be written uniquely as F(x) = F e (x) + F 0 (x), where F e (x) is even (symmetric 
about zero) and F 0 (x) is odd (anti-symmetric about zero), and that 
J F e {x)b 2y _^ (x) = F 0 (x)b 2y (x) = 0 . This enables the transform to be written 

x x 

5 equivalently as: 

L<«-l)/2j [n/2j 

Hy) = S F e (x)b 2y (x) + X f o (*) Vi W 

y=0 y=\ 

Fig. 2 illustrates a flow chart 200 of a method for providing faster transforms 
using scaled terms. In Fig. 2, transform equations are split into at least one sub- 
transform having at least two transform constants 210. The term "sub-transforms", 

10 as used herein, references the collection of equations used to generate a subset of 
the transformed terms, where the subset may contain all of the transformed terms, 
or fewer that the total number of transformed terms. Next, the transform constants 
for each collection are scaled independently of the other collections with a scaling 
term to maintain a substantially uniform ratio between the transform constants within 

15 the collection, wherein the scaling term may be chosen according to a 

predetermined cost function 220. The result is the transform equations for 
transforming the block. Then, data is separated into at least one block 230. The 
block is then transformed into transformed data using the transform equations 240. 
Referring to the quantizer 130 of Fig. 1 , the transformed data may then be quantized 

20 by incorporating the scaling into the quantization. Choosing the scaled term for the 
constants requires the use of a cost function that represents the needs of the target 
system. 
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Scaled terms may be used to replace multiplicative constants like cosine 
terms in a Discrete Cosine Transform (DCT) with a minimum number of 
additions/subtractions. For a 1-D DCT, an 8 x 1 input vector F may be multiplied by 

an 8 x 8 transform matrix D: F = DF . In the case of the 2-D DCT, the input vector F 
5 is replaced with an 8 x 8 matrix F, and the DCT is performed as DFD ' , where D ' is 

n 

the transpose of D. Put D = ^ d D k . where d D k , the "detail transform", is the kth 

m ^ 

refinement to an approximation to D. Put D m =^ d D k and F m = D m F ; i.e., the mth 

□ approximation to F . Those skilled in the art will recognize that the 8x8 matrix of 

m input samples and corresponding 8x8 transform matrix could be replaced with 

=F 10 N1xN2 matrix of input samples using NixNi transform on the left and N 2 xN 2 
Jt; transform on the right. 

j\ However, because the scaled terms merely approximate the constants in the 

nj transform equations, some error is accepted to keep the precision confined to a fixed 

yd 

O number of bits or to minimize the number of operations. If the resulting numbers are 

15 further from a decision boundary (e.g., a threshold value or a quantization boundary) 
than the maximum possible error, the result will not affected by the approximations. 
Nevertheless, faster transforms may then be obtained by refining the incremental 
calculations to obtain additional precision if the resulting numbers are determined 
during the incremental calculations to require additional precision. 
20 Fig. 3 illustrates a flow chart 300 for providing faster transforms using 

corrective action to provide faster transform calculations and decreased execution 
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times. At least one number resulting from an incremental calculation using 
transform constants in a transform is tested 310. Then, based upon the testing, 
when to perform a corrective action is determined 320. Once it is determined that a 
corrective action is to be performed, the corrective action is performed 330. 
5 A first example of refinement occurs when each d D k adds an at least one 

additional bit of precision to the transform performed by D. A second example 

occurs when at least one element of the transform vector, F , is assumed to be very 
small, so that an entire row of JD k may be approximated as zero, enabling us to skip 
the calculation of that at least one element of F. 
10 In the first example, it is often the case that all of the D k axe invertible; i.e., a 

matrix D k exists such that D k D k = D k 1 D k = /, the identity matrix, which has ones on 
the upper left to lower right diagonal, and zeros elsewhere. In this case, it may be 
noted that 

K+i =D m+l D'jF m =(l+ d D m+x D- m % 
15 (where I is the identity matrix); i.e., the additional step of precision is provided by 
performing one more step of transform to the transformed coefficients. Using this 
additional step transform to add the precision is the first embodiment of refinement 
provided by this invention, since it saves performing both IDCT and subsequent 
DCT: the matrix for (/ -f d D m+l D~ l ) can be calculated ahead of time as a matrix R m+11 

20 so thatF m+1 = R m+l F m , a single-step transform on F m . 

The second example of refinement requires a different approach. Consider a 
specific example, where the 2-D transform has already been performed with high 
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precision in the first dimension, FD', d D 0 has its 8th row zero, an6 d D 1 = D- d D 0 . 
Then d D 0 is not invertible; i.e., there is no way to recover the original 8 columns of 
FD 1 from d D 0 FD ' (this follows from the fact that finding FD' from d D 0 FD ' may be 
viewed as 7 equations in 8 unknowns). However, if an assumption is made for one 

of the 8 columns of FD ' , then the other 7 columns can be estimated from cA>FD ' » 
contingent on the assumption for the 8th column. A reasonable assumption is that 
the 8th column contains small elements that may be approximated as zero, since the 
higher-numbered transformed values tend to be less significant in real images than 
the lower-numbered transformed values. Then d D 0 may be treated as an 8 x 7 

matrix (ignoring the zero row), the pseudo-inverse, d D G , (as is well-known in the 

literature) is found by 



with an 8th row of zeros inserted for the assumed 8th coefficients. This gives an 8 x 
8 approximation for d^" 1 ' so that we can approximate 



This approximate refinement is the second embodiment of the refinement 
invention, which saves the cycles of the IDCT followed by the DCT, as in the first 
example. 

The abort procedure is used to determine when a calculation can be 
terminated before its completion to save cycles, when the result of the calculation is 
projected to be too small, so that it wilt be quantized to zero. One example of the 
application of the abort procedure appears in example 2 above, where at least one 
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low-magnitude transform coefficient is not calculated, being essentially equivalent to 
setting the corresponding row or rows of the transform matrix to zero. Another 
example is stopping a calculation with limited precision, when additional transform 
precision is projected to provide negligible additional information in the transformed 
5 values; e.g., when the result of the calculation is projected to be small. An 

alternative method involves testing the magnitude of the sums and/or differences of 
some of the inputs to the transform. For example, for the FDCT, the following 
equation calculates the second transform coefficient: 

1 0 25 (2) = C 2 d 01M + C 6 d l625 

where d mA = s 07 -s 34 and d m5 = s [6 -s 25 , notation from Pennebaker and Mitchell's 

JPEG text. The magnitudes of these values can be tested for impact on subsequent 
processing of the transform coefficients. In this example, if S(2) is less than the 
15 magnitude of Q/2 (where Q is a quantization value for S(2)), then S(2) will be 

quantized to zero. This translates to a test of whether d m4 is less than Q/(2C 2 ) in 

magnitude and d l625 is less than Q/(2C 6 ) in magnitude. If this test is met, then the 

calculation for S(2) can be aborted, and S(2) set to its quantized value of zero. This 
method of testing sums and/or differences of the input values can be extended to all 
20 of the equations for the FDCT. 

It is not obvious how to turn a comparison such as -f < F <f (a term-by-term 

range check for the members of vector or matrix F ), where the elements of f are 
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all non-negative, into a term-by-term comparison of the elements of F, -T< F< 7", 
where the elements of T are all non-negative, and where satisfying the test on F is 
sufficient to satisfy the test on F . The difficulty arises from the fact that the DCT 
employs both positive and negative operations, which destroys the term-by-term 

5 ordering in the equation. Specifically, it cannot be said that -f<F<f implies 

that- zr l f <F <D~ x f . 

Thus, the abort involves terminating the precision of an operation when 
additional transform precision is projected to have an acceptable or negligible effect 
on the results of the subsequent processing operations, e.g., quantization or 
10 comparison. For example, the coefficients of the DCT can be scaled by an integer 
and approximated as sums of powers of 2. For the odd terms, one of these 
approximations is as follows: 
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20 



which we write as (using the notation from above), 

4W = 2 5 d D 0 + 2 3 d D l +2 l d D 2 + 2° d D 3 
Also, as mentioned above, all of the matrices above, and their sequential 
sums, are invertible. Now, if jF « 1 , i.e., the jth element of F is very small in 

magnitude, then j (32 d D 0 F) should be small. If it is not, then j ((8 d D, +2 d D 2 + d DJ 
F) will not be able to cancel it out to make the final result small. The relative 

Page 21 

BLD9-2000-0064US01 

ALG 501.379US01 
Patent Application 




magnitudes of the results of the calculations may be checked. If one of the 
transform values, for one of the intermediate precisions, is small compared to the 
other values, or is small compared to some pre-determined threshold, then 
subsequent refinements for that transform value can be aborted. 
5 Fig. 4 illustrates a flow chart 400 of the abort method according to the present 

invention that demonstrates aborting further iterations of the transform coefficient 
calculation process. In Fig. 4, at least one incrementally calculated number is tested 
410. If certain criteria are met, further calculations are aborted 420. The 
incremental calculation of a transform coefficient may be aborted when an error 

10 resulting from terminating the incremental calculation is acceptable. For example, 
the incremental calculation may be terminated when a determination is made that 
the incremental calculation will result in a number that is projected to be within a 
predetermined range, e.g., a transform coefficient that does satisfy a precision 
requirement. Alternatively, the incremental calculation of the transform coefficient 

15 may be aborted when a transform coefficient is going to be within a predetermined 
range of zero. 

Fig. 5 is a flow chart 500 of the testing of the at least one incrementally 
calculated number. In Fig. 5, the incremental calculation is tested to determine 
when a refinement to the transform coefficient will not change the result 510. This 
20 testing may be carried out in at least two ways as shown in Fig. 5. A refinement to 
the transform coefficient may be determined not to change the result when, after 
checking the relative magnitudes of the results of the incremental calculations, an 
intermediate calculation of at least one transform coefficient is small compared to the 
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intermediate calculation of another transform coefficient 520. Alternatively, a 
refinement to the transform coefficient may be determined not to change the result 
when, after checking the magnitude of the results of at least one incremental 
calculation, at least one intermediate calculation of the transform coefficient is less 
5 than a predetermined threshold 530. 

Fig. 6 is a flow chart 600 of the refinement method according to the present 
invention. First, a determination is made whether the transform requires more 
precision 610. The transform is a transform matrix, wherein a refinement matrix may 
be used to increase precision of the incremental calculation of the transform 

10 coefficients. When more precision is required, a refinement matrix is applied to the 
transform 620. The refinement matrix is generated offline or at initialization and is 
based on approximately calculated transform constants. 

Fig. 7 illustrates a flow chart 700 of a first method for generating a refinement 
matrix. First, it is recognized that an approximate transform is invertible 710. The 

15 refinement matrix given by /+ rf £> m+1 D~ l is generated 720. Then, the transform is 

structured for efficient computation 730. 

However, as described above, when d Do is not invertible; there is no way to 
recover the original 8 columns of FD' from d D 0 FD 1 . Fig. 8 is a flow chart 800 
showing a second method for generating a refinement matrix when d D 0 is not 
20 invertible. It is first recognized that recovery of the nth column of a transform matrix 
for generating the transform is impossible 810. A pseudo inverse for a portion of the 
transform matrix is calculated 820. Then, an approximation for the refinement matrix 
is generated using the pseudo inverse for the transform matrix 830. The 
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approximation of the refinement matrix comprises l+ d D u D 0 . 

Fig. 9 illustrates a block diagram 900 of a printer 920 according to the present 
invention. In Fig. 9, the printer 920 receives image data 912 from a host processor 
910. The image data 912 is provided into memory 930 where the image data may 
be arranged into NixN 2 block samples. The N1XN2 block samples are then 
processed by a processor 940, such as a raster image processor. The raster image 
processor 940 provides a compressed print stream representing the image data to a 
printhead driving circuit 950. The printhead driving circuit 950 then controls the 
printhead 960 to generate a printout 970 of the image data. 

The process illustrated with reference to Figs. 1-3 may be tangibly embodied 
in a computer-readable medium or carrier 990, e.g. one or more of the fixed and/or 
removable data storage devices illustrated in Fig. 9, or other data storage or data 
communications devices. The computer program may be loaded into the memory 
992 to configure the processor 940 of Fig. 9, for execution. The computer program 
comprises instructions which, when read and executed by the processor 940 of Fig. 
9, causes the processor 940 to perform the steps necessary to execute the steps or 
elements of the present invention. 

Fig. 10 illustrates a data analyzing system 1000 according to the present 
invention. In Fig. 10, a transformer 1010 receives a block of data 1012 to be 
analyzed. The transformer 1010 uses transform equations 1020 to generate 
transformed data 1024. Transform equations 1020 are split into at least one sub- 
transform having at least two transform constants. The at least two transform 
constants for each collection are scaled independently of the other collections with a 
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scaling term to maintain a substantially uniform ratio between the at least two 
transform constants within the at least one collection, wherein the scaling term is 
chosen according to a predetermined cost function. The transformed data 1024 may 
then be quantized by an optional quantizer 1 030. 
5 Fig. 1 1 illustrates another data analyzing system 1 1 00 according to the 

present invention. In Fig. 1 1 , a transformer 1110 receives a block of data 1 1 1 2 to be 
analyzed. The transformer 1110 uses transform equations 1 120 to generate 
transformed data 1 124. Transform equations 1 120 are split into at least one sub- 
transform having at least two transform constants. The at least two transform 
^ 10 constants for each collection are scaled independently of the other collections with a 
5j scaling term to maintain a substantially uniform ratio between the at least two 

=p transform constants within the at least one collection, wherein the scaling term may 

W be chosen according to a predetermined cost function. The transformed data 1 124 

jr may then be compared to scaled comparison values in an optional comparator 1 130. 

; : t 1 5 The foregoing description of the exemplary embodiment of the invention has 

P been presented for the purposes of illustration and description. It is not intended to 

be exhaustive or to limit the invention to the precise form disclosed. Many 
modifications and variations are possible in light of the above teaching. It is 
intended that the scope of the invention be limited not with this detailed description, 
20 but rather by the claims appended hereto. 
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